An Efficient Block-based Dynamic Range Adjustment Method in Noise-robust Continuous Speech Recognition

نویسندگان

Yiming SUN

Yoshikazu MIYANAGA

چکیده

This paper proposes a new technique for speech feature estimation under noise circumstances. This new approach yields noise-robust continuous speech recognition (CSR). Noiserobust techniques for isolated word speech recognition typically employ the running spectrum analysis (RSA), the running spectrum filtering (RSF) and the dynamic range adjustment (DRA) methods. Among them, only RSA has been applied into a CSR system. However, we propose an enhanced DRA for a noise-robust CSR system. Thus, in the speech recognition stage, the continuous speech waveform is automatically divided into short blocks and DRA is applied to these blocks. We find that the proposed method improves recognition performance under several different noise and SNR conditions.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Noise-Robust Continuous Speech Recognition System Using Block-Based Dynamic Range Adjustment

SUMMARY A new approach to speech feature estimation under noise circumstances is proposed in this paper. It is used in noise-robust continuous speech recognition (CSR). As the noise robust techniques in isolated word speech recognition, the running spectrum analysis (RSA), the running spectrum filtering (RSF) and the dynamic range adjustment (DRA) methods have been developed. Among them, only R...

متن کامل

Improving the performance of MFCC for Persian robust speech recognition

The Mel Frequency cepstral coefficients are the most widely used feature in speech recognition but they are very sensitive to noise. In this paper to achieve a satisfactorily performance in Automatic Speech Recognition (ASR) applications we introduce a noise robust new set of MFCC vector estimated through following steps. First, spectral mean normalization is a pre-processing which applies to t...

متن کامل

Robust Speech Recognition with MSC/DRA Feature Extraction on Modulation Spectrum Domain

This report introduces noise robust speech recognition and proposes advanced speech analysis techniques named MSC (Modulation Spectrum Control)/DRA (Dynamic Range Adjustment). The dynamic range of cepstrum obtained from noisy speech is usually smaller than that from the same speech without noise since some speech features are hidden in noise. This difference may cause recognition errors. Theref...

متن کامل

روشی جدید در بازشناسی مقاوم گفتار مبتنی بر دادگان مفقود با استفاده از شبکه عصبی دوسویه

Performance of speech recognition systems is greatly reduced when speech corrupted by noise. One common method for robust speech recognition systems is missing feature methods. In this way, the components in time - frequency representation of signal (Spectrogram) that present low signal to noise ratio (SNR), are tagged as missing and deleted then replaced by remained components and statistical ...

متن کامل

Compression of Model-based Group Delay Function for Robust Speech Recognition

In this paper, we improve the performance of the ARGDMF [3] feature by adding a nonlinear filtering block. ARGDMF is a group delay-based feature consists of four main parts, namely autoregressive (AR) model extraction, group delay function (GDF) calculation, compression, and scale information augmentation. The main problem with the GDF is its spiky nature which is solved by coupling the GDF wit...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2011

An Efficient Block-based Dynamic Range Adjustment Method in Noise-robust Continuous Speech Recognition

نویسندگان

چکیده

منابع مشابه

A Noise-Robust Continuous Speech Recognition System Using Block-Based Dynamic Range Adjustment

Improving the performance of MFCC for Persian robust speech recognition

Robust Speech Recognition with MSC/DRA Feature Extraction on Modulation Spectrum Domain

روشی جدید در بازشناسی مقاوم گفتار مبتنی بر دادگان مفقود با استفاده از شبکه عصبی دوسویه

Compression of Model-based Group Delay Function for Robust Speech Recognition

عنوان ژورنال:

اشتراک گذاری